Survey on Clustering High-Dimensional data using Hubness

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Study on Clustering High Dimensional Data Using Hubness Phenomenon

Data mining is the non-trivial process of extracting information from the very large database. In recent years, data repository has a high dimensional data, which makes a complete search in most of the data mining problems leads computationally infeasible. To eradicate this problem clustering plays a vital role in handling low dimensional data and high dimensional data. Low dimensional data mak...

متن کامل

A Survey on Clustering High Dimensional Data Techniques

Cluster analysis is the one in which uses to divide the data into groups. It mainly developed for the propose of summarization and improved understanding. The example for cluster analysis has been given below. Let we takes the group which related to document for browsing. That are in order to find the genes and proteins which has similar functionality, or as a means of data compression. The ter...

متن کامل

The Role Of Hubness in High-dimensional Data Analysis

Machine learning in intrinsically high-dimensional data is known to be challenging and this is usually referred to as the curse of dimensionality. Designing machine learning methods that perform well in many dimensions is critical, since highdimensional data arises often in practical applications and typical examples include textual, image and multimedia feature representations, as well as time...

متن کامل

Clustering High Dimensional Data Using SVM

The Web contains massive amount of documents from across the globe to the point where it has become impossible to classify them manually. This project’s goal is to find a new method for clustering documents that are as close to humans’ classification as possible and at the same time to reduce the size of the documents. This project uses a combination of Latent Semantic Indexing (LSI) with Singu...

متن کامل

High-dimensional data clustering

Clustering in high-dimensional spaces is a difficult problem which is recurrent in many domains, for example in image analysis. The difficulty is due to the fact that highdimensional data usually live in different low-dimensional subspaces hidden in the original space. This paper presents a family of Gaussian mixture models designed for highdimensional data which combine the ideas of subspace c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Scientific Research in Computer Science, Engineering and Information Technology

سال: 2020

ISSN: 2456-3307

DOI: 10.32628/cseit195671